Theoretical limitations of Encoder-Decoder GAN architectures

نویسندگان

Sanjeev Arora

Andrej Risteski

Yi Zhang

چکیده

Encoder-decoder GANs architectures (e.g., BiGAN and ALI) seek to add an “inference” mechanism to the GANs setup, consisting of a small encoder deep net that maps data-points to their succinct encodings. The intuition is that being forced to train an encoder alongside the usual generator forces the system to learn meaningful mappings from the code to the data-point and vice-versa, which should improve the learning of the target distribution and ameliorate mode-collapse. It should also yield meaningful codes that are useful as features for downstream tasks. The current paper shows rigorously that even on real-life distributions of images, the encode-decoder GAN training objectives (a) cannot prevent mode collapse; i.e. the objective can be near-optimal even when the generated distribution has low and finite support (b) cannot prevent learning meaningless codes for data – essentially white noise. Thus if encoder-decoder GANs do indeed work then it must be due to reasons as yet not understood, since the training objective can be low even for meaningless solutions. Though the result statement may see reminiscent in spirit to the ICML’17 paper (Arora et al., 2017), the proof is novel.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Decoupling Encoder and Decoder Networks for Abstractive Document Summarization

Abstractive document summarization seeks to automatically generate a summary for a document, based on some abstract “understanding” of the original document. State-of-the-art techniques traditionally use attentive encoder–decoder architectures. However, due to the large number of parameters in these models, they require large training datasets and long training times. In this paper, we propose ...

متن کامل

An Encoder-Decoder Based Convolution Neural Network (CNN) for Future Advanced Driver Assistance System (ADAS)

We propose a practical Convolution Neural Network (CNN) model termed the CNN for Semantic Segmentation for driver Assistance system (CSSA). It is a novel semantic segmentation model for probabilistic pixel-wise segmentation, which is able to predict pixel-wise class labels of a given input image. Recently, scene understanding has turned out to be one of the emerging areas of research, and pixel...

متن کامل

SegNet: A Deep Convolutional Encoder-Decoder Architecture for Scene Segmentation

We present a novel and practical deep fully convolutional neural network architecture for semantic pixel-wise segmentation termed SegNet. This core trainable segmentation engine consists of an encoder network, a corresponding decoder network followed by a pixel-wise classification layer. The architecture of the encoder network is topologically identical to the 13 convolutional layers in the VGG...

متن کامل

Architecture for Programmable Generator

395 Abstract—Reed-Solomon Codes are popularly used for error correction in many applications like storage devices (CD, DVD), wireless communications, high speed modems and satellite communications. In this paper, a modified scheme for programmable generator polynomial based Reed-Solomon encoder and decoder has been proposed. The works reported in this paper corrects errors in derived equations ...

متن کامل

Advanced Hash-based Distributed Video Coding

Distributed video coding (DVC), also known as Wyner-Ziv video coding, provides low-complexity encoding solutions for video. In contrast to traditional predictive coding systems, which capture the temporal redundancies at the encoder side, this task is transferred to the decoder. This radical shift of the computational burden towards the decoder brought in DVC is particularly attractive for reco...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

CoRR

دوره abs/1711.02651 شماره

صفحات -

تاریخ انتشار 2017

Theoretical limitations of Encoder-Decoder GAN architectures

نویسندگان

چکیده

منابع مشابه

Decoupling Encoder and Decoder Networks for Abstractive Document Summarization

An Encoder-Decoder Based Convolution Neural Network (CNN) for Future Advanced Driver Assistance System (ADAS)

SegNet: A Deep Convolutional Encoder-Decoder Architecture for Scene Segmentation

Architecture for Programmable Generator

Advanced Hash-based Distributed Video Coding

عنوان ژورنال:

اشتراک گذاری